AITopics | matthews correlation coefficient

Collaborating Authors

matthews correlation coefficient

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Appendix for the Paper: " SizeShiftReg: a Regularization Method for Improving Size-Generalization in Graph Neural Networks "

Neural Information Processing SystemsAug-19-2025, 01:06:43 GMT

This Appendix contains the following additional material. In Section A we report more statistics regarding the considered datasets. E we show the overhead (in terms of training time) incurred by our method. Table 1: Dataset statistics, this table is taken from Y ehudai et al. [13], Bevilacqua et al. [1]. In Figure 1 we show the continuation of Figure 2 (b) from the main paper, including plots on all datasets.

artificial intelligence, machine learning, regularization, (15 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Europe > Italy (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Deep Learning for Glioblastoma Morpho-pathological Features Identification: A BraTS-Pathology Challenge Solution

Zhang, Juexin, Weng, Ying, Chen, Ke

arXiv.org Artificial IntelligenceJul-25-2025

Glioblastoma, a highly aggressive brain tumor with diverse molecular and pathological features, poses a diagnostic challenge due to its heterogeneity. Accurate diagnosis and assessment of this heterogeneity are essential for choosing the right treatment and improving patient outcomes. Traditional methods rely on identifying specific features in tissue samples, but deep learning offers a promising approach for improved glioblastoma diagnosis. In this paper, we present our approach to the BraTS-Path Challenge 2024. We leverage a pre-trained model and fine-tune it on the BraTS-Path training dataset. Our model demonstrates poor performance on the challenging BraTS-Path validation set, as rigorously assessed by the Synapse online platform. The model achieves an accuracy of 0.392229, a recall of 0.392229, and a F1-score of 0.392229, indicating a consistent ability to correctly identify instances under the target condition. Notably, our model exhibits perfect specificity of 0.898704, showing an exceptional capacity to correctly classify negative cases. Moreover, a Matthews Correlation Coefficient (MCC) of 0.255267 is calculated, to signify a limited positive correlation between predicted and actual values and highlight our model's overall predictive power. Our solution also achieves the second place during the testing phase.

artificial intelligence, deep learning, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2507.18133

Country: Asia > China (0.16)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Therapeutic Area > Oncology > Brain Cancer (0.97)
Health & Medicine > Therapeutic Area > Oncology > Childhood Cancer (0.87)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.70)

Add feedback

Interactive Classification Metrics: A graphical application to build robust intuition for classification model evaluation

Brown, David H., Chicco, Davide

arXiv.org Machine LearningDec-22-2024

Machine learning continues to grow in popularity in academia, in industry, and is increasingly used in other fields. However, most of the common metrics used to evaluate even simple binary classification models have shortcomings that are neither immediately obvious nor consistently taught to practitioners. Here we present Interactive Classification Metrics (ICM), an application to visualize and explore the relationships between different evaluation metrics. The user changes the distribution statistics and explores corresponding changes across a suite of evaluation metrics. The interactive, graphical nature of this tool emphasizes the tradeoffs of each metric without the overhead of data wrangling and model training. The goals of this application are: (1) to aid practitioners in the ever-expanding machine learning field to choose the most appropriate evaluation metrics for their classification problem; (2) to promote careful attention to interpretation that is required even in the simplest scenarios like binary classification. Our application is publicly available for free under the MIT license as a Python package on PyPI at https://pypi.org/project/interactive-classification-metrics and on GitHub at https://github.com/davhbrown/interactive_classification_metrics.

application, artificial intelligence, machine learning, (12 more...)

arXiv.org Machine Learning

2412.17066

Country:

North America > Canada > Ontario > Toronto (0.15)
North America > United States > Texas > Travis County > Austin (0.04)
Europe > Italy > Lombardy > Milan (0.04)

Genre: Research Report > Experimental Study (0.48)

Industry:

Health & Medicine (0.70)
Government > Regional Government (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Empirical Analysis of Efficient Fine-Tuning Methods for Large Pre-Trained Language Models

Doering, Nigel, Gorlla, Cyril, Tuttle, Trevor, Vijay, Adhvaith

arXiv.org Artificial IntelligenceJan-8-2024

Fine-tuning large pre-trained language models for downstream tasks remains a critical challenge in natural language processing. This paper presents an empirical analysis comparing two efficient fine-tuning methods - BitFit and adapter modules - to standard full model fine-tuning. Experiments conducted on GLUE benchmark datasets (MRPC, COLA, STS-B) reveal several key insights. The BitFit approach, which trains only bias terms and task heads, matches full fine-tuning performance across varying amounts of training data and time constraints. It demonstrates remarkable stability even with only 30\% of data, outperforming full fine-tuning at intermediate data levels. Adapter modules exhibit high variability, with inconsistent gains over default models. The findings indicate BitFit offers an attractive balance between performance and parameter efficiency. Our work provides valuable perspectives on model tuning, emphasizing robustness and highlighting BitFit as a promising alternative for resource-constrained or streaming task settings. The analysis offers actionable guidelines for efficient adaptation of large pre-trained models, while illustrating open challenges in stabilizing techniques like adapter modules.

dataset, fine-tuning, language model, (14 more...)

arXiv.org Artificial Intelligence

2401.04051

Country:

North America > United States > California > San Diego County > San Diego (0.04)
North America > United States > California > San Diego County > La Jolla (0.04)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)
Africa > Rwanda > Kigali > Kigali (0.04)

Genre: Research Report > New Finding (0.94)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Algorithms for automatic intents extraction and utterances classification for goal-oriented dialogue systems

Legashev, Leonid, Shukhman, Alexander, Zhigalov, Arthur

arXiv.org Artificial IntelligenceDec-15-2023

Modern machine learning techniques in the natural language processing domain can be used to automatically generate scripts for goal-oriented dialogue systems. The current article presents a general framework for studying the automatic generation of scripts for goal-oriented dialogue systems. A method for preprocessing dialog data sets in JSON format is described. A comparison is made of two methods for extracting user intent based on BERTopic and latent Dirichlet allocation. A comparison has been made of two implemented algorithms for classifying statements of users of a goal-oriented dialogue system based on logistic regression and BERT transformer models. The BERT transformer approach using the bert-base-uncased model showed better results for the three metrics Precision (0.80), F1-score (0.78) and Matthews correlation coefficient (0.74) in comparison with other methods.

arxiv preprint arxiv, goal-oriented dialogue system, intent extraction and utterance classification, (8 more...)

arXiv.org Artificial Intelligence

2312.09658

Country:

Europe > Russia > Volga Federal District > Orenburg Oblast > Orenburg (0.05)
Asia > Russia (0.04)

Genre: Research Report (0.91)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.35)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.34)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.34)

Add feedback

Does the evaluation stand up to evaluation? A first-principle approach to the evaluation of classifiers

Dyrland, K., Lundervold, A. S., Mana, P. G. L. Porta

arXiv.org Artificial IntelligenceFeb-21-2023

How can one meaningfully make a measurement, if the meter does not conform to any standard and its scale expands or shrinks depending on what is measured? In the present work it is argued that current evaluation practices for machine-learning classifiers are affected by this kind of problem, leading to negative consequences when classifiers are put to real use; consequences that could have been avoided. It is proposed that evaluation be grounded on Decision Theory, and the implications of such foundation are explored. The main result is that every evaluation metric must be a linear combination of confusion-matrix elements, with coefficients - "utilities" - that depend on the specific classification problem. For binary classification, the space of such possible metrics is effectively two-dimensional. It is shown that popular metrics such as precision, balanced accuracy, Matthews Correlation Coefficient, Fowlkes-Mallows index, F1-measure, and Area Under the Curve are never optimal: they always give rise to an in-principle avoidable fraction of incorrect evaluations. This fraction is even larger than would be caused by the use of a decision-theoretic metric with moderately wrong coefficients.

artificial intelligence, machine learning, matrix, (16 more...)

arXiv.org Artificial Intelligence

2302.12006

Country:

North America > United States (0.47)
Europe (0.46)

Genre:

Research Report (0.82)
Instructional Material (0.67)

Industry: Health & Medicine > Diagnostic Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Goodness of Fit Metrics for Multi-class Predictor

Itai, Uri, Katz, Natan

arXiv.org Artificial IntelligenceAug-11-2022

The multi-class prediction had gained popularity over recent years. Thus measuring fit goodness becomes a cardinal question that researchers often have to deal with. Several metrics are commonly used for this task. However, when one has to decide about the right measurement, he must consider that different use-cases impose different constraints that govern this decision. A leading constraint at least in \emph{real world} multi-class problems is imbalanced data: Multi categorical problems hardly provide symmetrical data. Hence, when we observe common KPIs (key performance indicators), e.g., Precision-Sensitivity or Accuracy, one can seldom interpret the obtained numbers into the model's actual needs. We suggest generalizing Matthew's correlation coefficient into multi-dimensions. This generalization is based on a geometrical interpretation of the generalized confusion matrix.

matrix, matthews correlation coefficient, metric, (14 more...)

arXiv.org Artificial Intelligence

2208.05651

Country: North America > United States (0.04)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.94)

Add feedback

Evaluation metrics: leave your comfort zone and try MCC and Brier Score

#artificialintelligenceJan-8-2022, 15:25:41 GMT

Data science is an interdisciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from noisy, structured and unstructured data, and apply knowledge and actionable insights from data across a broad range of application domains. Machine learning instead is the study of computer algorithms that can improve automatically through experience and by the use of data. It is seen as a part of artificial intelligence. Data scientist around the world apply machine learning in order to build models able to predict future events, cluster people/objects into similar groups and also identify unexpected anomalies. Every enthusiastic data scientist knows that the most exciting part of machine learning is to choose the coolest algorithm capable of solve the problem of interest (Supervised or Unsupervised).

brier score, evaluation metric, prediction, (13 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Extracting Angina Symptoms from Clinical Notes Using Pre-Trained Transformer Architectures

Eisman, Aaron S., Shah, Nishant R., Eickhoff, Carsten, Zerveas, George, Chen, Elizabeth S., Wu, Wen-Chih, Sarkar, Indra Neil

arXiv.org Artificial IntelligenceOct-12-2020

Anginal symptoms can connote increased cardiac risk and a need for change in cardiovascular management. This study evaluated the potential to extract these symptoms from physician notes using the Bidirectional Encoder from Transformers language model fine-tuned on a domain-specific corpus. The history of present illness section of 459 expert annotated primary care physician notes from consecutive patients referred for cardiac testing without known atherosclerotic cardiovascular disease were included. Notes were annotated for positive and negative mentions of chest pain and shortness of breath characterization. The results demonstrate high sensitivity and specificity for the detection of chest pain or discomfort, substernal chest pain, shortness of breath, and dyspnea on exertion. Small sample size limited extracting factors related to provocation and palliation of chest pain. This study provides a promising starting point for the natural language processing of physician notes to characterize clinically actionable anginal symptoms. Introduction Angina pectoris is a constellation of symptoms that portends inadequate oxygenation of cardiac muscle due to either a decrease in coronary blood supply, an increase in myocardial oxygen demand, or both.

chest pain, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2010.05757

Country:

North America > United States > Rhode Island > Providence County > Providence (0.05)
Asia > Middle East > Israel (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Diagnostic Medicine (1.00)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Filters

Collaborating Authors

matthews correlation coefficient

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

ceeb3fa5be458f08fbb12a5bb783aac8-Supplemental-Conference.pdf

Appendix for the Paper: " SizeShiftReg: a Regularization Method for Improving Size-Generalization in Graph Neural Networks "

Deep Learning for Glioblastoma Morpho-pathological Features Identification: A BraTS-Pathology Challenge Solution

Interactive Classification Metrics: A graphical application to build robust intuition for classification model evaluation

Empirical Analysis of Efficient Fine-Tuning Methods for Large Pre-Trained Language Models

Algorithms for automatic intents extraction and utterances classification for goal-oriented dialogue systems

Does the evaluation stand up to evaluation? A first-principle approach to the evaluation of classifiers

Goodness of Fit Metrics for Multi-class Predictor

Evaluation metrics: leave your comfort zone and try MCC and Brier Score

Extracting Angina Symptoms from Clinical Notes Using Pre-Trained Transformer Architectures